Value functions for depth-limited solving in zero-sum imperfect-information games
نویسندگان
چکیده
We provide a formal definition of depth-limited games together with an accessible and rigorous explanation the underlying concepts, both which were previously missing in imperfect-information games. The works for arbitrary (perfect recall) extensive-form game is not tied to any specific game-solving algorithm. Moreover, this framework unifies significantly extends three approaches solving that existed multiagent reinforcement learning but known be compatible. A key ingredient these value functions. Focusing on two-player zero-sum games, we show how obtain optimal functions prove public information provides necessary sufficient context computing them. domain-independent encoding domains allows approximating even by simple feed-forward neural networks, are then able generalize unseen parts game. use resulting network implement version counterfactual regret minimization. In distinct domains, algorithm's exploitability roughly linearly dependent network's quality it difficult train CFR's performance as good CFR access full
منابع مشابه
Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
We develop and evaluate a new exact algorithm for finding Nash equilibria of two-player zero-sum extensive-form games with imperfect information. Our approach is based on the sequenceform representation of the game, and uses an algorithmic framework of double-oracle methods that have been used successfully in other classes of games. The algorithm uses an iterative decomposition, solving restric...
متن کاملSolving two-person zero-sum repeated games of incomplete information
In repeated games with incomplete information, rational agents must carefully weigh the tradeoffs of advantageously exploiting their information to achieve a short-term gain versus carefully concealing their information so as not to give up a long-term informed advantage. The theory of infinitelyrepeated two-player zero-sum games with incomplete information has been carefully studied, beginning...
متن کاملLimited Lookahead in Imperfect-Information Games
Limited lookahead has been studied for decades in complete-information games. We initiate a new direction via two simultaneous deviation points: generalization to incomplete-information games and a game-theoretic approach. We study how one should act when facing an opponent whose lookahead is limited. We study this for opponents that differ based on their lookahead depth, based on whether they,...
متن کاملHustling in Repeated Zero-Sum Games with Imperfect Execution
We study repeated games in which players have imperfect execution skill and one player’s true skill is not common knowledge. In these settings the possibility arises of a player “hustling”, or pretending to have lower execution skill than they actually have. Focusing on repeated zero-sum games, we provide a hustle-proof strategy; this strategy maximizes a player’s payoff, regardless of the true...
متن کاملEndgame Solving in Large Imperfect-Information Games
The leading approach for computing strong game-theoretic strategies in large imperfect-information games is to first solve an abstracted version of the game offline, then perform a table lookup during game play. We consider a modification to this approach where we solve the portion of the game that we have actually reached in real time to a greater degree of accuracy than in the initial computa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial Intelligence
سال: 2023
ISSN: ['2633-1403']
DOI: https://doi.org/10.1016/j.artint.2022.103805